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Abstract. Lindstrom theorems characterize logics in terms of model-theoretic conditions 
such as Compactness and the Lowenheim-Skolem property. Most existing characterizations 
of this kind concern extensions of first-order logic. But on the other hand, many logics 
relevant to computer science are fragments or extensions of fragments of first-order logic, 
e.g., A;- variable logics and various modal logics. Finding Lindstrom theorems for these 
languages can be on coding arguments that seem to require the full expressive power of 
first-order logic. 

In this paper, we provide Lindstrom theorems for several fragments of first-order logic, 
including the fc-variable fragments for k > 2, Tarski's relation algebra, graded modal 
logic, and the binary guarded fragment. We use two different proof techniques. One is a 
modification of the original Lindstrom proof. The other involves the modal concepts of 
bisimulation, tree unraveling, and finite depth. Our results also imply semantic preserva- 
tion theorems. 



There are many ways to capture the expressive power of a logical language C. For 
instance, one can characterize C as being a model-theoretically well behaved fragment of a 
richer language £' (a preservation theorem), or as being maximally expressive while satis- 
fying certain model-theoretic properties (a Lindstrom theorem). The main contribution of 
this paper is a series of Lindstrom theorems for fragments of first-order logic. We also show 
connections between our Lindstrom theorems and preservation theorems. 

The original Lindstrom theorem for first-order logic, in one of its most widely used 
formulations, says the following: 
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Figure 1: Fragments of first-order logic 



The first-order Lindstrom Theorem [13] An extension of first-order logic satisfies Com- 
pactness and the Lowenheim-Skolem property if and only if it is no more expressive than 
first-order logic. 

There are several other versions of the theorem, characterizing first-order logic for 
instance in terms of Compactness and invariance for potential isomorphisms (the 'Karp 
Property'). Analogues of this result have been obtained for various extensions of first-order 
logic. On the other hand, few Lindstrom theorems are known for fragments of first-order 
logic. One notable example is Van Benthem's recent Lindstrom theorem for modal logic: 
The modal Lindstrom theorem [24] An extension of basic modal logic satisfies Compact- 
ness and Bisimulation Invariance if and only if it is no more expressive than basic modal 
logic. Q 

We are not aware of a similar characterization for modal logic involving the Lowenheim- 
Skolem property. Note that first-order logic itself is a compact proper extension of modal 
logic that has the latter property. 

Our primary motivation for considering fragments comes from computer science logic. 
Many logics relevant to computer science are fragments (or non-FO extensions of fragments) 
of first-order logic, for example /c-variable logics and various modal logics. Finding Lind- 
strom theorems for such languages can be a challenging problem, since most techniques used 
in the past to prove Lindstrom theorems rely on coding arguments that seem to require the 
full expressive power of first-order logic. For a recent survey of Lindstrom theorems in a gen- 
eral setting, see [ID]. Another motivation is that, by widening our domain of study to logics 
not necessarily extending first-order logic, we may come to know more about first-order 
logic itself. 

We follow two global lines of attack. First, we take the original Lindstrom theorem 
for first-order logic and generalize the proof as much as possible. In this way, we obtain 
Lindstrom theorems for the finite variable fragments FO k with k > 2 and Tarski's relation 
algebra. Next, we take the modal Lindstrom theorem as a starting point, and try to 
generalize it to richer languages. In this way, we obtain Lindstrom theorems for graded 
modal logic (on arbitrary Kripke structures and on trees) and the binary guarded fragment. 

Many open questions remain. For example, we have not been able to find Lindstrom 
theorems for the two- variable fragment or the full guarded fragment. 



By extensions of basic modal logic we mean language extensions, not axiomatic extensions. 
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2. From first-order logic downwards 

In this first part, we take the classic Lindstrom theorem as a starting point, and we 
show that the proof generalizes to certain fragments of first-order logic. 

2.1. A strengthening of the Lindstrom theorem for first-order logic over binary 
vocabularies. The first-order Lindstrom theorem uniquely characterizes first-order logic 
in terms of Compactness and the Lowenheim-Skolem property within the class of all its 
extensions. As we will show in this section, this result can be improved: first-order logic 
can be characterized in terms of Compactness and the Lowenheim-Skolem property within 
the class of all extensions of the three-variable fragment FO 3 , if we consider vocabularies 
consisting only of unary and binary relation symbols. The proof is not substantially more 
difficult than that of the original Lindstrom theorem, requiring mainly some extra care with 
coding partial isomorphisms and back-and-forth behaviour. But this simple strengthening 
will allow us to obtain a number of new results on Tarski's relation algebra, as well as finite 
variable fragments. 

To keep things simple, we will work with a fixed relational signature consisting of a set 
of unary relation symbols and a set of binary relation symbols, both countably infinite. 

By an abstract logic we mean a pair £ = (Fml/;, \=c), where Fml£ is the set of sentences 
of £ and \=c is a binary truth relation between /^-sentences and models. If no confusion 
arises, we will sometimes write £ for Fml£ and (= for \=£. We assume that /^-sentences are 
preserved under isomorphisms, and that £ has the following properties: 

► closure under Boolean connectives: for every <p G £ there is a sentence ip € £ expressing 
its negation (i.e., for all models M, M \= ip iff M <p), and for every <p,ip € £ there is a 
sentence x £ £ expressing the conjunction of <p and ip. 

► closure under renamings: for every mapping p sending relation symbols to relation sym- 
bols of the same arity, and for every sentence (p £ £, there is a sentence ip € £ such that 
for all models M, M f= ip iff p(M) \= <p. 

► closure under relativisation by unary predicates: for every sentence <p € £ and unary 
relation symbol P, there is a sentence ip £ £ such that for all models M, M \= ip iff 
M p \= <p, with M p the submodel of M induced by the subset defined by P. 

Examples of abstract logics in this sense include first-order logic (FO) and its /c-variable 
fragments (FO k ), with k > 1. 

Given two abstract logics, C and £', we say that £ extends £' (or, £' is contained in 
£, denoted by £' C £), if there is a map / : Fml/;/ — > Fml^ preserving truth in the sense 
that, for all models M and sentences <p G £', M \=c (p iff M \=c f((f>)- 

An abstract logic £ has Compactness if for every set of ^-formulas S, if every finite 
subset of £ is satisfiable then the entire set E is satisfiable. An abstract logic £ has the 
Lowenheim-Skolem property if every satisfiable set of /^-formulas has a countable model. 

First, we show that each compact extension of FO (in fact, already of FO 2 ) has the 
"finite occurrence property" . 

Lemma 2.1 (Finite occurrence property). Let £ be any abstract logic extending FO 2 that 
has Compactness. Then for any <p £ £ there is a finite set of relation symbols REL(cp) such 
that the truth of <p in any model is independent of the denotation of relation symbols outside 
REL(4>). 
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Figure 2: Model from the proof of Lemma 12, 21 

Proof. A standard argument, cf. [8]: since our vocabulary contains infinitely many unary 
and binary relation symbols, there are renamings pi,P2 whose range is disjoint. Take any (p € 
C, and let <j>\, <p2 be its renamings according to p\ and p2- Let E = {ix\ . . . Xk{pi{R){x\ . . . Xk) <-> 
p2{R)(x\ . . . Xk)) | R a /c-ary relation symbol}. Then E |= <p\ <-> 02, and hence, by Com- 
pactness, a finite subset E' C E implies </>i <-> 02- We can pick for REL{<p) the relation 
symbols occurring in £'. Note that, in case of binary vocabularies, all formulas in £ belong 
to FO 2 . □ 

The key to the proof of our improved Lindstrom theorem is the following observation: 

Lemma 2.2. Let C be any abstract logic with the Lowenheim-Skolem property and the Finite 
Occurrence Property, such that C extends FO 3 and is not contained in FO. Then "C can 
relatively projectively define finiteness" : there is a formula ij) G C with a unary predicate N 
such that, for each n £ N, there is a model of tp with exactly n elements satisfying N , while 
no model of ip has infinitely many elements satisfying N. 

Proof. The basic idea is the same as in traditional proofs of the Lindstrom theorem (e.g., 
[8]). Our main contribution is to show that, in the case of binary vocabularies, the coding 
argument requires only three variables. 

Take any (p € C not belonging to FO. Then for each k £ N, there are models 2tjt |= <p 
and 53^ \/= (p that are potentially isomorphic up to back-and- forth depth k. At the same 
time, no potentially isomorphic models can disagree on the sentence (p. We can describe 
this situation inside C. The construction is outlined in Figure [2j 

The model depicted in Figure [2] describes two models, connected via a collection of 
partial isomorphisms, that disagree on the sentence (p. The most important feature is that, 
if is an infinite set, then the collection of partial isomorphisms constitutes a potential 
isomorphism, whereas if N is finite (say, of size k), the collection of partial isomorphisms 
constitutes a potential isomorphism up to back- and- forth depth k. 
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More precisely, A and B are unary predicates denning the domains of two (sub)models, 
P is a unary predicate whose elements denote pairs from Ax B, and the elements of F rep- 
resent partial isomorphisms (i.e., sets of pairs constituting structure preserving bijections). 
The arrows represent a binary relation R. For instance, in the given example, / represents 
the partial isomorphism {(a±, b±), (ct2, 62)}) and g represents the partial isomorphism that 
extends / with the pair (03, 63). The elements of the linearly ordered set N will be used to 
associate a finite index to each partial isomorphism. 

Claim: Each of the following properties of this model can be expressed by a sentence of C: 

(1) Every p G P is associated to a pair from A x B. 

(2) Every / € F is associated to a set of elements of P that form a partial bijection between 
A and B. 

(3) Each such partial bijection preserves structure on the submodels defined by A and B, 
as far as the finitely many relations occurring in <f> are concerned (recall that C has the 
Finite Occurrence Property). 

(4) Every / £ F has an associated 'index' from N. 

(5) N is linearly ordered by R, such that there is a minimal element, and each non-maximal 
element has an immediate successor (in particular, if N is infinite, then it contains an 
infinite ascending chain). 

(6) If fRg for f,g G F, this means that g extends / (as a partial bijection), and that the 
index of g is the successor of the index of /. 

(7) The back- and- forth properties hold for partial isomorphisms whose index is not the 
maximal element of N. 

(8) Some / € F has as its index the minimal element of N. 

(9) The submodels defined by A and B disagree on <p. (Recall that C is closed under the 
Boolean connectives and relativisation by unary predicates). 

Proof of claim: The first eight properties can already be expressed in FO 3 by a clever re- 
use of variables, and the ninth property can be expressed in C by closure under the Boolean 
connectives and relativisation by unary predicates. 

For instance, the third property can be expressed as the conjunction of all F0 3 -formulas 
of the following forms, where S € REL((f>) is a binary relation symbol, and Q € REL{<f>) is 
a unary relation symbol: 

Vxy^Px A Py A 3z(Fz A Rzx A Rzy) — > (3z(Rxz A Az A 3x(Ryx A Ax A Szx)) 

<-> 3z(Rxz ABz A 3x(Ryx A Bx A Szx)) 

and 

Vx(Px -»■ (3z(Rxz A Az A Qz) <-> 3z(Rxz ABzAQz 



Note that we crucially use the fact that the signature consists of unary and binary relations 
only. □ 

Let x De the conjunction of all these ^-sentences. By the above assumptions, \ nas 
models in which TV has arbitrarily large finite cardinality (this follows from the existence, 
for each k G N, of models disagreeing on <j) that are potentially isomorphic up to back-and- 
forth depth k). However, there is no model of x m which N is an infinite set. For, suppose 
there were. Let 6 be a formula expressing that a fresh binary relation symbol S denotes a 



6 



J. VAN BENTHEM, B. TEN CATE, AND J. VAAnANEN 



linear order without maximal element on the set denoted by N, thus forcing N to denote an 
infinite set. Then xA0 would also be satisfiable, hence, by the Lowenheim-Skolem property, 
it would have a countable model. In other words, x would have a countable model in which 
iV denotes an infinite set. In countable models, being potentially isomorphic means being 
isomorphic; thus, there would be isomorphic models disagreeing on eft, which contradicts £'s 
invar iance for isomorphisms. 

In other words, x relatively projectively defines finiteness. □ 

Theorem 2.3. An abstract logic extending FO 3 is contained in FO iff it satisfies both 
Compactness and the Lowenheim-Skolem property. 

Proof. If an abstract logic is contained in FO, then, clearly, it satisfies Compactness and the 
Lowenheim-Skolem property. If, on the other hand, an abstract logic C extends FO 3 but 
is not contained in FO, then it must lack either the Lowenheim-Skolem property or Com- 
pactness. For, suppose that £ satisfies the Lowenheim-Skolem property and Compactness. 
Let ip(N) be any ^-sentence projectively defining finiteness (cf. Lemma l2.2p . and for each 
k 6 N, let Xk be an i ? 3 -formula expressing that there are at least k distinct iV-elements 
(by reusing bound variables as in 3x(Nx A 3y(Ny A Rxy A 3x{Nx A Ryx A • • • )))). Then 
every finite subset of {xk \ k £ N} U {tf)(N)} has a model while the entire set has no model. 
This contradicts the Compactness property. □ 

The following results can be proved in a similar fashion: 

Theorem 2.4. An abstract logic extending FO 3 is contained in FO iff it satisfies Com- 
pactness and invariance for potential isomorphisms. 

Proof sketch. An analogue of Lemma [2.2l can be proved where the Lowenheim-Skolem prop- 
erty is replaced by invariance for potential isomorphisms. Indeed, the proof is even shorter 
than the proof of Lemma 12.21 itself, as the last step can be dropped. Given this, we can 
proceed just as in the proof of Theorem 12.31 □ 

Theorem 2.5. A concrete abstract logic extending FO 3 is contained in FO iff it satisfies 
the Lowenheim-Skolem property and the Finite Occurrence Property, and is recursively 
enumerable for validity. 

Here, by "concrete" we mean that formulas can be coded as finite strings over some 
alphabet, in such a way that negation, conjunction, and relativisation are computable op- 
erations, and there is a computable translation from i ? 3 -formulas to formulas of the logic. 
Theorem 12.51 follows from Lemma 12.21 by closure under relativisation and the fact that 
satisfiability of FO 3 formulas on finite models is undecidable [6], hence not recursively 
enumerable. 

Note that these results all rely on our restriction to at most binary relation symbols. 
In the case with at most fe-ary relations (k > 1), analogous results hold for FO k+1 . For the 
case of k = 1, the argument is slightly different. For the sake of completeness, we give an 
outline of it below (similarly, analogues of Theorems 12.41 and 12.51 can be shown for k = 1). 
See also the Lindstrom theorem for monadic first-order logic in |21j . 

Theorem 2.6. Over unary vocabularies, an abstract logic extending FO 2 is contained in 
FO iff it satisfies both Compactness and the Lowenheim-Skolem property. 

Proof sketch. If an abstract logic is contained in FO, then, clearly, it satisfies Compactness 
and the Lowenheim-Skolem property. Conversely, let C be any abstract logic extending 
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FO 2 that is not contained in FO. Let <p £ £ be non-first-order, i.e., for every > 0, there 
are models M, N disagreeing on such that M and are potentially isomorphic up to 
depth k. 

By the same argument as in Lemma 12.11 we have that 4> depends only on finitely 
many (unary) predicates Pi, . . . ,P n . Moreover, it can be easily seen that two structures are 
potentially isomorphic up to depth k for Pi, . . . ,P n if and only if for each unary type r, 
either r is realized the same number of times in the two structures, or in both structures r 
is realized at least k times. By a unary type we mean here a function mapping P±, . . . ,P n 
to truth values. Similarly, two structures are potentially isomorphic (for unbounded depth) 
for Pi, . . . ,P n if and only if for each unary type r, either r is realized the same number of 
times in the two structures, or in both structures r is realized infinitely many times. 

Now, pick fresh unary predicates Q, Q and infinitely many fresh unary predicates P r i, for 
r a type and i > 1. The unary predicates Q, Q will be used to define two submodels, which 
will disagree on <f>, and the unary predicate P T i will be used to code a potential isomorphism. 
As we pointed out already, in order to test for the existence of a potential isomorphism it 
is enough to count the number of times each unary type is realized. The predicate P T i, 
intuitively, will denote the i-th element of type r in one of the two substructures, and they 
are used to compare the cardinalities between the two substructures. More specifically, 
consider the infinite theory £ containing the following F0 2 -sentences: 

(j) Q A ^(fP A \fx(Qx ~^Qx) 
Vx(P Ti (x) - t{x)) 

Vx(P ri (x) -> ^P Tj (x)) for i / j 

Mxy(Qx A P ri x A Qy A P ri y -> x = y) 
Vxy(^Qx A P T iX A ^Qy A P T iy — > x = y) 

3x.(Qx A P T (i + ux) — > 3x.(Qx A x.P T ix) 
3x.(^Qx A P T ti-\-i)x) — > 3x.(-iQx A P T ix) 

3x(Qx A Px A Ai<fc<i ^ p rkx) -> 3x.(Qx A P ri x) 
3x(^Qx A Px A f\i<k<i ^ p rkx) -> 3x.(^Qx A P ri x) 

3x(Qx A P T ix) «-> 3x(-iQx A P r jx) 
for r any type and i, j > 1, and where t(x) is shorthand for /\ T , P \ =1 P(x) A /\ T ^ P -j =Q ^P(x). 
The first sentence expresses that the two substructures disagree on <j). The instances of the 
2nd - 9th sentence above force the following: for all types r and natural numbers i > 0, 
a substructure (induced by Q or by —>Q) contains an element satisfying P T i if and only if 
it contains at least i elements of type r. The instances of the last sentence above, finally, 
compare the cardinalities between the two structures. It follows that the theory £ describes 
models in which the submodels induced by Q and —>Q disagree on (ft and are potentially 
isomorphic. 

Incidentally, note that if there are infinitely many elements of type r, the theory S 
cannot force each to satisfy some predicate P T i (as can be seen by an omitting types argu- 
ment), but it does force there to be infinitely many elements satisfying some predicate P T i, 
which suffices for present purposes. 
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Since, for every k > 0, there are models disagreeing on cj> such that the two are poten- 
tially isomorphic up to depth k, it can be shown that every finite subset of S is satisfiable. 
It follows that C cannot have both Compactness and Lowenheim-Skolem, for, the entire 
theory S would have a countable model, and on countable structures, potential isomor- 
phism implies isomorphism. □ 



2.2. First application: Tarski's relation algebra. Tarski's relation algebra 7ZA [IT] 
is an algebraic language in which the terms denote binary relations. It has atomic terms 
R,S,... ranging over binary relations (over some domain), constants 5 and T denoting the 
identity relation and the total relation, and operators n, — , •, - for taking the intersection, 
complement, composition and converse of relations. Thus, the syntax of 7ZA is: 

a ::= R\5\T\af]f3\a — P\a»(3\ 
with R an element from some countably infinite set of variables standing for binary relations. 
An interpretation for this language is a set X together with an assignment of binary relations 
over X to the atomic terms. In other words, it is a first-order structure for the vocabulary 
that contains the atomic terms as binary relation symbols. We write a = j3 if , in each 
interpretation, a and (3 denote the same binary relation, and we write a C /3 if, in each 
interpretation, a denotes a subrelation of the relation denoted by j3. 

In this section, we provide a Lindstrbm-theorem for extensions of relation algebra. By 
an extended relation algebra we will mean any language obtained by extending the syntax 
of relation algebra with zero or more additional logical operations, where a logical operation 
is any operation that takes as input a fixed finite number of binary relations R\, . . . ,R n 
(over some set X) and that produces a new binary relation S over the same set. We also 
require logical operations to respect isomorphisms, and to be domain independent in the 
following sense (familiar from database theory) : the output relation S only depends on part 
of the domain X that participates in at least one pair belonging to an input relation 
An example of a domain dependent operation is the (unary) absolute complement operator, 
while the (binary) relative complement operator is domain independent. Another example 
of a domain dependent operator is the zero-ary operator that returns the total relation 
in case the domain X is infinite, and the empty relation otherwise. We disallow domain 
dependent operations, as they may cause the language to lose closure under relativisation. 

Lemma 2.7 (Closure under relativisation). Let C be any extended relation algebra, let a be 
any expression of C, and let R be any relation variable. Then there is an expression of C, 
which we denote by a dom ^ such that in every interpretation M , the denotation of a dom ^ R ) 
in M is the denotation of a in M 1 , where M 1 is the submodel of M induced by the domain 
of R M . 

Proof. First, note that the expression Tr := R • T • R~~^ denote the total relation over the 
domain of R. Now, 

a dom(R) ig 

obtained by uniformly replacing every atomic expression a 
(i.e., the relational variables, as well as T and 5) by a n Tr. A straightforward formula 
induction shows that a. dom ( R > satisfies the required conditions. The inductive step for com- 
plex expressions uses the fact that all operations of C are domain independent. □ 
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One example of an extended relation algebra, TZAfo, is the extension of relation algebra 
with all first- order definable logical operations (see e.g. |25j). Another example is 7ZAT, the 
extension of relation algebra with the transitive closure operation |15] . 

The Compactness and Lowenheim-Skolem properties can be defined for extended rela- 
tion algebras as usual. For instance, we say that an extended relation algebra C has the 
Lowenheim-Skolem property if for every set of /^-expressions <&, if there is an interpreta- 
tion under which P| $ is a non-empty relation, then there is such an interpretation over a 
countable domain. As is not hard to see, 1ZA and IZAfo satisfy both Compactness and the 
Lowenheim-Skolem property, whereas 1ZAT satisfies the Lowenheim-Skolem property but 
lacks Compactness. 

The following Lindstrom-style theorem shows that all extended relation algebras con- 
taining non-first-order definable operations lack either Compactness or the Lowenheim- 
Skolem property. 

Theorem 2.8. Let C be any extended relation algebra with the Compactness and Lowenheim- 
Skolem properties. Then every logical operation of C is first-order definable. 

Proof. Lemma 12.21 can be adapted to the relation algebra setting, allowing us to show 
that every extended relation algebra containing a non-elementary logical operation and 
having the Lowenheim-Skolem property can projectively define finiteness, and hence lacks 
Compactness. We will not spell out the details, but merely mention the following key points 
of the proof: 

► For every first-order sentence <\> containing only three variables, in a signature consisting 
only of binary relations, there is a relation algebra expression a such that for every model 
M, M \= <t> iff a = T holds in M pi]. 

► Unary relations can be mimicked by binary relations by systematically intersecting them 
with the identity relation. 

► In this way, every extended relation algebra gives rise to an abstract logic extending FO 3 . 
Closure under relativisation of the logic is guaranteed by the domain independence of the 
logical operations of C. □ 

In other words, the extension of relation algebra with all elementary operations (a 
well-known structure in algebraic logic) is the greatest extension that satisfies Lowenheim- 
Skolem and Compactness. The same holds if we replace the Lowenheim-Skolem property 
by invariance for potential isomorphisms, or if we replace Compactness by recursive enu- 
merability. 

Theorem 12.81 nicely complements a known result: every extended relation algebra with 
Craig interpolation can define all first- order definable operations [18]. Together, these results 
show that IZAfo is the unique (up to expressive equivalence) extension of 1ZA satisfying 
Compactness, Lowenheim-Skolem, and Craig Interpolation. 

2.3. Second application: finite variable fragments. In this section, we provide Lind- 
strom theorems for the finite variable fragments FO k with k > 3, over vocabularies con- 
sisting of unary and binary relation symbols only. It is well known that the finite variable 
fragments can be characterized as fragments of first-order logic using potential isomorphisms 
with a restricted number of pebbles: 
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Definition 2.9. (/c-pebble potential isomorphisms) A k-pebble potential isomophism 
between M and iV is a non-empty family F of finite partial isomorphisms / between M 
and N with \dom(f)\ < k satisfying the following properties: 

► Closure under restrictions: for all / 6 F and /' C /, also /' G F. 

► Forth: for all {{w\,v\), . . . , (w n ,v n )} G F with n < k, and w G M, there is an v G iV 
such that {(wi,vx), ... , (w n ,v n ), (w,v)} G F 

► Back: for all {(w\,v\), . . . , (w n ,v n )} G F with n < k, and v £ N, there is an to 6 M 
such that {(toijUi), . . . , (io n ,v n ), (w,v)} G F 

Fact 2.10. FO k is (up to logical equivalence) the fragment of FO invariant for /c-pebble 
potential isomorphisms {k > 1). 

For a proof of Fact 12.101 see for instance pQ . 

Using Lemma \2.2\ we can turn this into the following Lindstrom characterization (re- 
member that we only consider unary and binary relation symbols): 

Theorem 2.11 (Lindstrom theorem for FO k ). Let k > 3. An abstract logic extending FO k 
satisfies Compactness and invariance for k-pebble potential isomorphisms iff it is no more 
expressive than FO k . 

Proof. Consider any abstract logic £ extending FO 5 that has Compactness and is invariant 
for A;-pebble potential isomorphisms. Then in particular it is invariant for potential isomor- 
phisms, and therefore by Theorem 12.41 it must be contained in FO. But then, by Fact 12.101 
it must be contained in FO k . □ 

Theorem 12.111 can also be seen as a strengthening of the classical preservation result 
in Fact 12.101 (for k > 3, and on binary vocabularies). Indeed, it implies (and in fact is 
equivalent to, as we will explain) the following "generalized preservation theorem" : 

Corollary 2.12. Let k > 3. Let £ be any abstract logic extending FO k with Compactness. 
Then FO k is the fragment of £ invariant for k-pebble potential isomorphisms (up to logical 
equivalence) . In particular, FO k is the fragment of FO invariant for k-pebble potential 
isomorphisms. 

Proof. Let £ be any compact abstract logic extending FO k , and let £ be the fragment 
of £ invariant for fc-pebble potential isomorphisms. Then £ satisfies all the requirements 
of abstract logics. For instance, it is closed under relativisation: if <j) G £ is invariant for 
A;-pebble potential isomorphisms, then so is its relativisation by a unary predicate. Thus, 
£ is an abstract logic extending FO k that is compact and invariant for A:-pebble potential 
isomorphisms. Hence, by Theorem 12.111 it is contained in FO k . □ 

In fact, Corollary 12.121 also implies Theorem l2.11l let L be any logic extending FO k sat- 
isfying Compactness and invariance for fc-pebble potential isomorphisms. By Corollary |2.12l 
FO k is the the fragment of C invariant for fc-pebble potential isomorphisms. But since C 
itself is invariant for fe-pebble potential isomorphisms, this means that £ and FO k coincide 
in terms of expressive power. This shows that Corollary 12.121 and Theorem 1 2 . 1 1 1 are really 
equivalent to each other: Lindstrom theorems and preservation theorems sometimes amount 
to the same thing. 
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3. From modal logic upwards 

The approach of generalizing the classic Lindstrom theorem only got us so far. It 
enabled us to characterize FO k for k > 3 but is unlikely to reach much further down, since 
coding power then falls under the minimum needed to describe partial isomorphisms and 
their extension properties. Thus, we will now take a different approach, by considering the 
modal Lindstrom theorem, and trying to generalize it to richer languages. In particular, we 
obtain two new Lindstrom theorems for the graded modal logic. 

3.1. The modal Lindstrom theorem revisited. We recall the proof of the modal Lind- 
strom theorem of [M| (which improves on an earlier result in [7]). First, we need to define 
"abstract modal logics" . As before, we assume a fixed vocabulary, now consisting of a sin- 
gle binary relation symbol R and a countably infinite set of unary relation symbols, also 
called proposition letters. Structures for this vocabulary are usually called Kripke models 
(the restriction to a single binary relation symbol is not essential but is convenient for 
presentation). We associate to each formula a class of pairs (M,w), where M is a Kripke 
model and w is an element of the domain of M. This is because modal formulas are always 
evaluated at a point in a model. We call such pairs (M, w) pointed Kripke models. Thus, an 
abstract modal logic is a pair C = (Fmlc, \=), where Fmlc is the set of formulas of C and 
|= is a binary relation between ^-formulas and pointed Kripke models. As before, when no 
confusion arises, we will write C for Fmlc- Also, as before, we assume that /^-formulas are 
invariant for isomorphisms, and that C is closed under the Boolean operations, renaming, 
and relativisation by unary predicates. 

Examples of abstract modal logics include basic modal logic, its extension with counting 
modalities called graded modal logic (GML), first-order logic (by which we mean the col- 
lection of all first-order formulas in one free variable, over the appropriate signature), and 
the modal //-calculus. For the syntax and semantics of basic modal logic, see any modal 
logic textbook (e.g., [5]). 

The modal Lindstrom theorem characterizes basic modal logic in terms of Compactness 
and invariance for bisimulations. A bisimulation between Kripke models M and N is a 
binary relation Z between the domains of M and ./V satisfying the following three conditions: 

► Atomic harmony: if wZv then w and v agree on all proposition letters (unary predi- 
cates). 

► Zig: if wZv and wR M w', there is a v' such that vR N v' and w'Zv'. 

► Zag: if wZv and vR v', there is a w' such that wR w' and w'Zv'. 

Two pointed Kripke models (M, w) and (N, v) are bisimilar if there is a bisimulation Z 
between M and N with wZv. A formula is bisimulation invariant if it does not distinguish 
bisimilar pointed Kripke models, and an abstract modal logic is bisimulation invariant if all 
its formulas are. 

Given a pointed Kripke model (M,w), we denote by M w the submodel of M contain- 
ing all points that are reachable in finitely many steps from w along the binary relation. 
Likewise, for k £ N, M k is the submodel of M containing all points reachable from w in at 

2 So far, there is nothing particularly 'modal' about these systems, and we might also speak of "abstract 
local logics", or some other appealing terminology. In particular, we do not replace isomorphism invariance 
by 'modal' invariance for bisimulations in the definition of these logics. The power of the latter condition is 
precisely one of the things we want to scrutinize in what follows. 
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most k steps along the binary relation. We say that a formula (ft is invariant for generated 
submodels if, for all models M with nodes w, (M,w) \= (ft iff (M w ,w) (= (ft. We say that 
(ft has the finite depth property if there is a k G N such that (M, iu) |= iff (M^,w) \= (ft, 
for all models M with nodes w. Clearly, the latter implies the former. Also, bisimulation 
invariance implies invariance for generated submodels, because the natural inclusion map 
is a bisimulation. An abstract modal logic C is invariant for generated submodels (or, has 
the finite depth property), if every (ft G C is invariant for generated submodels (respectively, 
has the finite depth property). 

We are now ready to proceed with the proof of the modal Lindstrom theorem, The- 
orem 13.31 below. We first prove a finite occurrence property for compact extensions of 
basic modal logic that satisfy invariance for bisimulation (in fact, invariance for generated 
submodels will do). 

Lemma 3.1 (Finite occurrence property). Let C be any abstract modal logic extending basic 
modal logic that is compact and invariant for generated submodels. Then for each (ft G C, 
there is a finite set of proposition letters PROP((ft) such that the truth of (ft in any pointed 
Kripke model is independent of the denotation of proposition letters outside PROP{(ft). 

Proof. Like that of Lemma 12.11 Since the set of proposition letters (unary predicates) is 
infinite, we can find renamings pi, P2 for them whose range is disjoint. Now, take any (ft G C, 
and let <fti,(j>2 be its renamings according to p\ and pi- Let S = {U n (p\(p) «-> P2{p)) \ n G 
N and p a proposition letter}, where stands for a sequence of n boxes. It follows from 
generated submodel invariance that £ \= (fti <-» 02, hence, by Compactness, a finite subset 
S'CE implies cft\ «-> 4>2- We can pick for PROP((ft) the set of proposition letters occurring 
in E'. □ 

Our crucial observation in this Lindstrom argument is the following: 

Lemma 3.2 ([23]). Let C be any abstract modal logic extending basic modal logic that is 
compact and invariant for generated submodels. Then C has the finite depth property. 

Proof. Let (ft G C, with p a proposition letter not occurring in (ft, and let (ft? be the relativi- 
sation of (ft by p. By the generated submo del-invar iance of C, {p, Up, UUp, . . .} |= (ft <-> cftP. 
By the compactness of C, there is an n G N such that {p, Up, . . . , U n p} \= (ft «-> (ftP. But this 
expresses exactly that (ft has the finite depth property, for depth n. □ 

Theorem 3.3 (|24J). An abstract modal logic extending basic modal logic satisfies Com- 
pactness and bisimulation invariance iff it is no more expressive than basic modal logic. 

Proof. Let C be any abstract modal logic extending basic modal logic and satisfying Com- 
pactness and bisimulation invariance. Since bisimulation invariance implies invariance for 
generated submodels, C is also invariant for generated submodels. Then, by Lemma 13.11 
and Lemma 13.21 it follows that £ has the finite occurrence property and the finite depth 
property. Next, we use the following well known fact [5]: 

Assuming a finite vocabulary, every bisimulation-invariant class of pointed 
models with the finite depth property is definable by a formula of basic 
modal logic. 

We conclude that C is contained in basic modal logic. □ 
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Theorem 13,31 can be seen as a strengthening of van Benthem's more familiar charac- 
terization of basic modal logic as the bisimulation invariant fragment of first-order logic. 
Indeed, it implies the latter theorem (as observed in [M])) but something even stronger 
holds, viz. the following "generalized preservation theorem": 

Corollary 3.4. Let C be any abstract modal logic extending basic modal logic that has 
Compactness. Then basic modal logic is the bisimulation invariant fragment of C (up to 
logical equivalence). In particular, basic modal logic is the bisimulation invariant fragment 
of first- order logic. 

Proof. Let £ be the bisimulation invariant fragment of C. Then £ satisfies all criteria 
for being an abstract modal logic. For instance it is closed under relativisation: whenever 
4> € C is invariant for bisimulations then the relativisation of eft by a unary predicate is also 
invariant for bisimulations. Likewise for the Boolean connectives. 

Hence, C! is an abstract modal logic extending basic modal logic and it is bisimulation 
invariant and Compact. Hence, it is no more expressive than basic modal logic. O 

Corollary 13.41 strengthens the traditional bisimulation preservation theorem, as there 
are compact extensions of basic modal logic not contained in first-order logic. Indeed: 

Theorem 3.5. There is an abstract modal logic extending basic modal logic that is not con- 
tained in first-order logic, but still satisfies Compactness, the Lowenheim-Skolem property, 
invariance for potential isomorphisms, and Craig interpolation. Moreover, it has a finite 
axiomatization and a PSPACE-complete satisfiability problem. 

The proof is given in the next section. Note that Theorem 13.51 is also interesting for 
another reason: it shows that Theorem 12.31 no longer holds when FO 3 is replaced by basic 
modal logic. 

By the same argument as before in the case of finite variable logics, Corollary 13.41 
not only follows from Theorem 13.31 but also implies it, and hence the two are equivalent. 
Our results resolve a question posed in [23] concerning the relationship between the modal 
Lindstrom theorem and the 'classic' bisimulation preservation theorem. 

3.2. A well-behaved non-elementary extension of modal logic. In this section, we 
prove Theorem [33J we introduce an extension of modal logic, ML', that is non-elementary 
but at the same time is very well behaved (e.g., it has the Compactness and Lowenheim- 
Skolem properties). ML' extends basic modal logic with an extra operator •. Thus, the 
formulas of ML' are given by: 

<fi ::= p | ->(/> | (f> Aip | ()4> I # 

The semantics of the newly added operator is as follows: M, w \= »<j> iff w has infinitely 
many reflexive successors satisfying <f> (a reflexive node is one related to itself). 

The reader may verify that • behaves as any normal modal operator: •(</> V ip) is 
equivalent to »<p V •t/j, and is equivalent to _L. The dual of • is denoted by •. 

Proposition 3.6. ML' is a non- elementary abstract modal logic extending basic modal 
logic. 

Proof. This is an easy verification: e.g., the language is closed under relativisation. □ 
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Proposition 3.7. ML' has the Lowenheim-Skolem property and is invariant for potential 
isomorphisms. 

Proof. By the containment of ML' in L WlW . □ 

Before we proceed to prove the remaining properties of ML', we will first provide 
an alternative semantics for the logic. Let K be the class of all bi-modal Kripke frames 
# = (W, R(>,R,) such that R, C R§ and for all (w,v) G R,, (v,v) G i?<>. Kripke models 
based on such frames can be thought of as quasi-models for ML'. The next lemma shows 
that they are adequate as such. 

Lemma 3.8. For all <\> G ML' , <\> is satisfiable according to the intended semantics iff ' <fi is 
satisfiable on the class K . 

Proof. In one direction, suppose that M, w \= 4> according to the intended semantics, where 
M = (W, R, V) is a uni-modal Kripke model. Define an equivalence relation ~ on W by 
letting v ~ v' iff v and v' assign the same truth value to all subformulas of (p. Let Rq := R 
and R, := {(v,v') G R | (v',v') G R and there are infinitely many v" ~ v' such that 
(v,v") G R and (v",v") G R}. Then the underlying Kripke frame of M' belongs to K, 
and a straightforward inductive argument shows that, for all subformulas ip of (p, and for 
all v G W, M, v \= ijj according to the intended semantics iff M' , v \= ip. In particular, 
M', w \= 4>. Incidentally, it is important for the above argument that ~ be an equivalence 
relation of finite index, so that the pigeon hole principle can be applied in the inductive 
clause for the new modality. 

Conversely, let M, w \= </>, where M = (W, i?<>, R,, V) is a bi-modal Kripke model based 
on a frame in K. We will construct a uni-modal model M' with a node w' such that 
M', w' \= <fi according to the intended semantics. Roughly, the construction of M' is based 
on the following ideas: (i) make sure that all R() \ i?,-successors of a node are irreflexive, 
by unraveling, and (ii) taking infinitely many copies of each R 9 successor of each node. 

More precisely, as the domain of M', we choose W x N. The accessibility relation R is 
the set of all pairs ((v,n), {y 1 ,m)) satisfying one of the following conditions: 

— m < 1, m 7^ n, and {v,v') G Rq; or 

— m > 2, and (v,v') G R,. 

Observe that, for all v G W, (v,0) and (v, 1) are i?-irreflexive by construction, whereas 
(v, n) for n > 2 is i?-reflexive or i?-irreflexive, depending on v. 

Finally, the valuation function V' for the atomic propositions is defined as usual, by 
letting V'{p) = {(v,n) \ v G V(p)}. Let M' = (W x N, R, V). 

A straightforward inductive argument now shows that, for all formulas ip, nodes v G W 
and natural numbers n, M, v \= ip iff M', (v, n) \= ip according to the intended semantics. 
In particular, we see that M' , (w, 0) |= 4>. □ 

In the remainder of this subsection, we use this alternative semantics. 

Proposition 3.9. A complete axiomatization for ML* can be obtained by extending any 
standard axiomatization for basic (bi-)modal logic with the following axioms: 

•4> — > 00 and •(4> — > 00) 

This axiomatization is even strongly complete: each consistent set of formulas is jointly 
satisfiable. 
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Proof. Both axioms are Sahlqvist formulas, and together, they define the frame class K. It 
follows by the Sahlqvist completeness theorem that the axiomatization is strongly complete 
for K: every consistent set of formulas is jointly satisfiable in a Kripke model based on a 
frame in K. 

It follows by Lemma 13.81 that the axiomatization is also sound and strongly complete 
with respect to the intended semantics of ML' (the proof of Lemma [3.81 actually shows that 
every set of formulas jointly satisfiable on K is jointly satisfiable according to the intended 
semantics) . □ 

It follows that ML' also satisfies Compactness. We add a few further interesting 
properties. 

Lemma 3.10. The satisfiability problem for ML' is PSPACE-comp/eie. 

Proof. The lower bound follows from the PSpace-hardness of basic modal logic. As for the 
upper bound, the standard PSpace-algorithm for basic modal logic due to Ladner can be 
extended in a straightforward manner to ML* (using the alternative semantics provided by 
Lemma I3.8P . 

Alternatively, one can also perform a direct reduction: for any ML' formula p, let p* 
be the uni- modal formula obtained by replacing each subformula of the form »ip by ()(r Aip), 
for r a (fixed) fresh proposition letter. Let Sub((j>) be the set of subformulas of cf>, and let 
depth(p) be the modal nesting depth of p>. 

It can be shown that, for any p € ML*, <fi is satisfiable on K iff 

p* A /\ U k {r Alp -> OVO 

k<depth((f>) 
ipeSub(</}) 

is a satisfiable basic modal formula. Roughly, the idea is that the additional conjuncts in 
the above formula guarantee that we can make every r-node reflexive in the model without 
affecting the truth of p. Together with Lemma [3.8l this gives us a polynomial time reduction 
from ML'-satisfiability to satisfiability of basic modal formulas. □ 

Proposition 3.11. ML' has the Craig Interpolation property: for all ML' -formulas 4>,ip, 
if H 4> * ^ then there is an ML' -formula x such that \= p> — > x> ^ X ~* ^ an d a ^ 
proposition letters occurring in x occur both in p> and in tp. 

Proof. This follows from a result in [14], according to which every multi-modal logic ax- 
iomatized by Sahlqvist formulas with first-order universal Horn correspondents has Craig 
interpolation in the sense described above (note that we do not require the modal operators 
occurring in x to occur both in <p and in ip). The two axioms from Proposition 13.91 are 
precisely of this form. □ 

Note that ML* is not invariant for bisimulations, and it does not have the finite model 
property. It would be of interest to characterize the extensions of basic modal logic satis- 
fying Compactness and the Finite Model Property. Perhaps this could still lead to some 
sort of modal counterpart to the Compactness-Lowenheim-Skolem version of the Lindstrom 
theorem for first-order logic. 
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3.3. Graded modal logic. Graded modal logic (GML) extends basic modal logic with 
counting modalities: for each formula <j) and natural number k, Ofc^ is admitted as a formula, 
and it says that at least k successors of the current node satisfy (p. 

GML-formulas are in general not invariant for bisimulations. Still, an important weaker 
invariance property does hold: GML formulas are invariant for tree unraveling. A tree model 
is a Kripke model whose underlying frame is a tree (in the graph theoretic sense, possibly 
infinite, and with a unique root). We will denote tree models by T, T', . . ., and we will use 
root(T) to denote the root of the tree model T. Every pointed Kripke model (M, w) can be 
unraveled into a tree model, by the following standard construction: 

Definition 3.12 (Tree unraveling). Given a Kripke model M = (W, R, V) and w € W, the 
tree unraveling unr(M, w) is defined as (W , R', V), where 

► W 1 consists of all finite paths (w\, u>2, • • • , w n ) satisfying w\ = w and WiRwi + \. 

► R' contains all pairs of sequences of the form ({w\, . . . , w n ), {w\, . . . , w n ,w n+ i)) £ W 1 x W' 

► (wi, . . . , w n ) E V'{p) iff w n G V(p). 

It is easily seen that, for any pointed Kripke model (M, w), unr(M, w) is indeed a tree 
model, and that {w) is its root. GML-formulas are invariant for this operation: 

Fact 3.13 (GML is invariant for tree unravelings). For all pointed Kripke models (M, w) 
and GML-formulas <fi, M,w \= 4> iff unr(M, w), (w) (= (p. 

The proof is a straightforward formula induction. 

We will prove two Lindstrom theorem for GML. The first characterizes GML on arbi- 
trary structures in terms of Compactness, the Lowenheim-Skolem property and invariance 
for tree unravelings. It can be seen as a natural generalization of Theorem 13.31 The second 
theorem, which will be proved in the following section, considers GML as a language for 
describing nodes in tree models, and it characterizes GML as being maximal with respect 
to Compactness and the Lowenheim-Skolem property on such structures. 

Theorem 13.31 characterized modal logic in terms of Compactness and bisimulation in- 
variance. One might wonder if, likewise, GML can be characterized by Compactness and 
invariance for tree unraveling. The answer is negative: the extension of GML with the 
modal operator Otti ( "uncountably many successors . . . ") still has these properties (even 
first-order logic extended with the " uncountably many" quantifier has (countable) Com- 
pactness [9]). Instead, we prove the following: 

Theorem 3.14. An abstract modal logic extending GML satisfies invariance for tree un- 
ravelings, Compactness, and the Lowenheim-Skolem property iff it is no more expressive 
than GML. 

As in the case of modal logic, we obtain the following "generalized preservation theorem" 
as a corollary (the proof is analogous to that of Corollary 13. 4p : 

Corollary 3.15. Let C be any abstract modal logic extending GML and satisfying Com- 
pactness and the Lowenheim-Skolem property. Then GML is the tree unraveling invariant 
fragment of C (up to logical equivalence). In particular, GML is the tree unraveling invari- 
ant fragment of FO . 

The rest of this section is dedicated to the proof of Theorem l3.14i Two easily established 
facts about GML will be used in the proof: 
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Figure 3: Model from the proof of Lemma 13.181 

Fact 3.16 (GML has the finite tree model property). Every satisfiable GML formula is 
satisfied at the root of some finite tree model. 

Fact 3.17. (GML can describe finite tree models up to isomorphism) Assuming 
a finite vocabulary, for every finite tree model T, there is a GML-formula ipT such that for 
every tree model T', (T', root(T')) (= ipr iff T = T. 

Fix an abstract modal logic £ extending GML and satisfying Compactness and Lowen- 
heim-Skolem, as well as invariance for tree unravelings. By Lemmas 13.11 and 13.21 £ has 
the finite occurrence property and the finite depth property (note that invariance for tree 
unraveling implies invariance for generated submodels). The following Lemma shows a kind 
of "finite width property" : 

Lemma 3.18. (Can only count successors up to a finite number) Let £ be any 

abstract modal logic extending GML and satisfying Compactness, Lowenheim-Skolem, and 
invariance for tree unravelings. Then for each formula <j) £ £ and finite tree model T, there 
is a natural number k such that "<j) can only count T -successors up to k ": whenever a tree 
model contains a node v that has k successor subtrees isomorphic to T , then adding more 
copies of T will not affect the truth value of 4> at the root. 

Proof. Since T is a finite tree model, it can be described up to isomorphism by a single 
GML-formula ipx- Let S be the following set of formulas, where p is a proposition letter 
not occurring in <p, and D n stands for a sequence of n boxes: 

{p, D n (^p -» | n e N} 

"p defines an initial subtree" 

U {U n (p -> D(->p -» Vt)) I n G N} 

"the root of every -ip-subtree satisfies t/t" 

U {U n {p A O^P -» Om(p A Vt)) I n, m G N} 

"every p-node with a -ip-successor has infinitely many p-successors satisfying ip?" 

Whenever a countable tree model satisfies S at the root, the submodel defined by p is 
isomorphic to the whole model — isomorphic in the language without p, to be precise (see 
Figure [3]). Since £ satisfies the Lowenheim-Skolem property, invariance for tree unraveling 
and invariance for isomorphisms, we can conclude that S |= eft <-> <fP . But then, by com- 
pactness, there is a k G N such that |= (f> «-» 4> p , where is the following subset of 
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S: 

{p, U n (-^p^U^p) | n G N} 
U { LT(p -» D(-.p -» ^ T )) I n G N } 

U { U n {p A <>-ip -> Om(p A Vt)) | n, m G N mtt m < k } 
This shows that the Lemma holds. □ 

Proof of Theorem \3.14\ Let C be any abstract modal logic extending GML, satisfying Com- 
pactness, the Lowenheim-Skolem property, and tree unraveling invariance. Observe that £ 
still satisfies Compactness and the Lowenheim-Skolem property if we restrict attention to 
trees (the tree unraveling of a countable model is countable). Also, by Lemmas 13. II and 13.21 
C has the finite occurrence property and the finite depth property (note that invariance for 
tree unraveling implies invariance for generated submodels). 

Consider any formula (ft G C. We will construct a set of equivalence relations ~^ for 
tree models (with i > 0), satisfying the following two properties: 

(1) T r-jf T' implies that the truth value of (ft at the root of a tree model is not affected if 
subtrees isomorphic to T at depth % are replaced by copies of T" (or vice versa) . 

(2) Each ~^ has only finitely many equivalence classes, and each is definable by a GML- 
formula. 

This then implies that (ft is equivalent to a GML formula (take the disjunction of the 
GML-formulas defining the ~q -equivalence classes that satisfy (ft). 

The claim holds trivially for i > depth(cft). Next, assume that the claim holds for 
i + 1. We will show that it also holds for i. Let K\, . . . ,K n be the (finitely many) ~ /^in- 
equivalence classes, and for each I < n, pick a finite representative G Ki (using Propo- 
sition [3716]) . It follows from Lemma 13.181 that there is a k G N such that, for all £ < n: "(/> 
can only count X^-successors up to /c", and hence, by ~^l_^-equivalence, u <p can only count 
^-successors up to k, at depth i" . But then, it follows that there are at most k n ■ 2 PROP ^ 
many ^-equivalence classes. Moreover, all these classes are explicitly definable by GML- 
formulas (in fact, by Boolean combinations of proposition letters and formulas of the form 
OmV' with m < k and tp a GML-formula defining some ~f ) _ 1 -equivalence class). 

Thus, C is not more expressive than GML on tree models. It follows that C is not more 
expressive than GML on arbitrary Kripke models: consider any £-formula (ft, and let ift be 
any GML-formula equivalent to (ft on tree models. If (ft «-» ift were falsifiable on an arbitrary 
model, then, by unraveling, it could be falsified on a tree, which, by assumption, is not the 
case. Thus, (ft and tft are equivalent on all Kripke models. □ 



3.4. Graded modal logic on trees. In this section, rather than assuming invariance 
for tree unraveling, we consider only tree models from the start. That is, we view GML 
as a language for describing nodes of tree models. From this perspective, GML has three 
distinctive limiting features: (i) when evaluated in a node, formulas can only see the subtree 
starting from that node; (ii) when evaluated at a node, each formula can only look finitely 
deeply into the subtree starting at that node; (Hi) each formula can only count successors 
up to a finite natural number (depending on the largest index occurring in the formula). 

Fact 3.19. On trees, GML has the Compactness and Lowenheim-Skolem properties. 
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Proof. This follows from the fact that GML has these properties on arbitrary structures 
(recall that the tree unraveling of a countable model is still countable). □ 

We will turn this into a Lindstrom characterization for GML on trees. However, before 
we proceed, two technical issues need to be discussed. 

The first concerns closure under relativisation. Recall from Sect. [37L1 that all abstract 
modal logics are assumed to satisfy this property. But what does it mean for a logic to 
be closed under relativisation if we consider only trees? Note that a submodel of a tree is 
not necessarily a tree. We solve this problem as follows. Given a tree model T containing 
a node n, and a unary predicate p true at n, we define Subtree(T,n,p) to be the largest 
subtree of T that contains n and contains only nodes satisfying p. Note that n is not 
necessarily the root of Subtree(T,n,p). We say that an abstract modal logic C is closed 
under relativisation on trees, if for every formula (ft G C and proposition letter p, there is a 
formula ip G L such that for all pointed tree models (T,n), (T,n) \= tp iff (T,n) \= p and 
(Subtree(T,n,p),n) \= (p. In the case of GML, we can simply pick ip to be the syntactic 
relativisation of eft by p, i.e., the formula obtained from eft Ap by replacing all subformulas 
of the form (} n tp by On(p A ip). 

Secondly, we need to make an extra assumption, namely that the extensions C we con- 
sider are closed under substitution. Intuitively, this means that C allows us to uniformly 
substitute formulas for proposition letters. More precisely, £ is closed under substitu- 
tion if for all formulas € C and proposition letters p, there is a formula x such that 
for all pointed (tree) models (M,w), (M,w) h X iff (M^MM.'OH'}] , w ) \= w here 
M [ P ^{v\(M,v)\=ip}] ig obtained from M by changing the valuation of p to {v \ (M, v) \= ip}. If 
we would not assume closure under substitution, there would be proper extensions of GML 
with Compactness and Lowenheim-Skolem. Indeed, the extension of GML with formulas of 
the form ()~p ("the current node has a parent that satisfies p") for p an atomic proposition, 
and closed under the Boolean connectives, is an example. This logic is not closed under 
substitution, as p may not be replaced by a complex formula in ~p. 

Theorem 3.20. Let C be an abstract modal logic closed under substitution and extending 
GML on trees. C satisfies Compactness and the Lowenheim-Skolem property on trees iff it 
is no more expressive than GML on trees. 

This is remarkable, since Compactness and Lowenheim-Skolem are also the character- 
izing features of first-order logic in the classic Lindstrom theorem. Note that first-order 
logic lacks Compactness on trees (due to the connectedness of trees). 

Now for the proof. Let C be an abstract modal logic satisfying the conditions of 
Theorem 13.201 Lemma 13.221 below shows that /^-formulas can only look downwards in the 
tree. 

Lemma 3.21. If C is not invariant for generated submodels, then there is a formula x G C 
containing a unary predicate p such that the following two conditions hold: 

(1) for all pointed trees (T,n), (T,n) \= x implies that n has a parent satisfying p 

(2) there is a pointed tree (T,n) satisfying x in which p is true only at the parent of n. 

Proof. Since £ is not invariant for generated submodels, there is a formula 4> G C and a 
pointed tree (T,n) such that (T, n) \= <f> and (T n ,n) ty= <p, or vice versa. Since C is closed 
under negation, we may assume w.l.o.g. that the former applies. Moreover, since C is closed 
under renamings and the set of proposition letters is infinite, we may assume that there are 



20 



J. VAN BENTHEM, B. TEN CATE, AND J. VAANANEN 



infinitely many proposition letters not occurring in (j>, in the sense that their interpretation 
does not influence truth of 4> at any state. We will refer to these proposition letters as being 
"fresh". 

Pick two distinct fresh proposition letters p, q, let (j) p and (fi q be the relativisations of <p 
to p and to q, and let £ be the following set of £-formulas: 

£ = {f,y,D fc ( P A?) | k > 0} 

Observe that £ is satisfiable: it is true at (T, n) when we make p true at all nodes, and q 
only at n and its descendants. We claim that truth of £ at a node n in a tree implies that 
n has a parent and it satisfies either p or q. For, if not then the submodels Subtree(T, n,p) 
and Subtree(T,n,q) would coincide, and hence (fP and (fi q would have to have the same 
truth value at n. 

Next, we will use Compactness to obtain a finite subset of £ that implies that the 
current node has a parent satisfying pV q. First, we 'redescribe' the situation encoded by £ 
from the perspective of the parent node. Let £' be the following set of /^-formulas, where 
r is another fresh proposition letter: 

£' = {<>(0P A -.0' A r), D(r -» D k (p A q)) | k > 0} 

By the previous observations, £' |= p V Hence, by Compactness, there is a £ G N such 
that A A r) A Afc<f 0(r ^ D k (pAq)) \= pV q. Going back to the perspective of the 
node n, if we define ip to be the formula c/jP A -^4> q A f\ k< £ D k (pAq), then t/j is satisfiable, and 
it implies the existence of a parent node satisfying pV q. To see this, suppose for the sake 
of contradiction that ip is satisfiable in a pointed tree (T' , n') and n' does not have a parent 
satisfying p V q. There are two cases. If n' has a parent (satisfying ^pA^q), we immediately 
derive a contradiction, by our earlier arguments. If n' does not have a parent, i.e., is the 
root of T", we may change the tree by adding a parent above n' satisfying —>pA —>q, without 
affecting the truth of ip at n', as follows from the definition of tp and the way we defined 
closure under relativisation. Therefore, we obtain again a contradiction. 

Finally, we take two more fresh proposition letters, s and t, and we use the fact that C 
is closed under substitution: we define x to be s A ip\p/(p A (()s — > t)),q/(q A (0s — > t))]. 

On the one hand, truth of x at a node implies it has a parent satisfying either (p A (0s —* 
t)) or A (<)s — > t)), and hence i. On the other hand, there is a pointed tree satisfying x 
in which t is only true at the parent node: take (T, n) and extend the valuation by making 
s true only at n, and t at its parent. □ 

Lemma 3.22. C is invariant for generated submodels. 

Proof. Suppose not. Let \{p) 6 £ be as described by Lemma f3 . 2 H By a "fresh renaming" 
of x we w dl mean a copy in which all proposition letters have been renamed to fresh ones, 
and which has been relativised by an additional fresh proposition letter. For the reasons 
explained in the proof of Lemma 13.211 we may assume that x(p) has infinitely many fresh 
renamings (Xi(Pi))ieN- 

Finally, we define £ to be the set of ^-formulas {xi(pi), Xlfefe)), Xi(X2(X3(P3))), • • •}• 
Every finite subset of £ is satisfiable. Indeed, a satisfying model may be constructed by 
"overlaying" different copies of the model (T,n) from Lemma 13.214 clause 2. On the other 
hand, if a node would satisfy all formulas in £ at once, its ancestors would form an infinite 
ascending chain, which contradicts the well-foundedness property of trees. Q 
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The remainder of the proof of Theorem 13,201 is along the same lines as in Sect. 13.31 
first we prove that C has the finite occurrence, finite depth and finite width properties on 
trees (using the fact that it is invariant for generated submodels), and then we derive the 
Lindstrom theorem by the same argument used in the proof of Theorem 13.141 

3.5. The guarded fragment. The guarded fragment GF forms a second extension of 
modal logic, incomparable to graded modal logic. It allows for arbitrary quantifications of 
the form 3y(G(x, y) A <f>(x, y)), where x and y are tuples of variables, and the guard G is an 
atomic formula containing the variables in x and y. The guarded fragment is decidable and 
has many 'modal' meta-properties, due to its invariance for guarded bisimulations [Tj 123]. 
see below for the definition. 

Because of its modal character, GF seems an obvious case for a Lindstrdm-style analysis 
like the one we have given for modal logic and graded modal logic. However, there are some 
technical difficulties, and we have not been able to prove an analogue of Theorem 13.31 for the 
guarded fragment yet. In this section, we focus on a special case, in fact the same special 
case as in the first half of the paper, namely for vocabularies with only unary and binary 
predicates. The Binary Guarded Fragment (G-Fbin) has the following syntax: 

4> ::= Rx | x = y | -i0 | <f) V if) \ 3y(G(x, y) A ip(x, y)) 

where R is a unary or binary atomic predicate, x and y are distinct variables, the guard G is 
an atomic formula containing both x and y (in any order) and ip contains no free variables 
besides (possibly) x and y. Note that, by this definition, unary guards such as Py, Ryy 
and y = y are not allowed, and also unguarded quantification over a single remaining free 
variable, as in 3y.cj)(y), can no longer be expressed by 3y.(y = y A 4>{y))- Thus, every 
GFb m -{oim\\\& contains at least one free variable. This prohibition on unary guards will be 
important, as it implies that the truth of a formula only depends on a local neighborhood 
of the elements assigned to the free variables. 

GFbin can be seen as an abstract logic contained in FO 2 , provided that the definition 
of abstract logics is adapted in order to take into account formulas with free variables, in 
the natural way. Below, we will assume such a generalized notion of abstract logic. When 
interpreted over Kripke models (and considering only formulas with one free variable), the 
language GFbin also constitutes an abstract modal logic extending basic modal logic. 

Guarded bisimulations admit a natural adaptation to this restricted version of the 
guarded fragment, which we call GFbin-bisimulations, defined below. 

Theorem 3.23. An abstract logic extending GF^\ n satisfies Compactness and Invariance 
for GF<o\ n -bisimulation iff it is no more expressive than GFbin ■ 

The proof is along the same lines as for modal logic and graded modal logic: using 
Compactness, we prove a finite occurrence property and a finite depth property (where 
depth is now measured as distance in the 'Gaifman graph'). We then use a tree-unraveling 
argument to show that GFbi n can express all properties invariant for guarded bisimulations 
and having the finite depth property. It is exactly in this last step that the restriction to 
unary and binary relation symbols, as well as the restrictions on the allowed guards, turn 
out to be crucial. Roughly, these restrictions allows us to relate distance in the unraveled 
tree to Gaifman distance in the original structure. 

In the remainder of this section, we present the proof in more detail. We start with 
some modal features that hold for GF in general. First, there is a natural syntactic notion 



22 



J. VAN BENTHEM, B. TEN CATE, AND J. VAANANEN 



of formula depth, whose inductive definition counts the above polyadic quantifiers as single 
steps: 

depth(Px) = 

depth(^(f)) = depth(4>) 

depth(cp V ip) = m&x(depth((f)) , depth(ijj)) 

depth(3y(G(x, y A (p(x, y))) = depth{4>) + 1 
Call a set of elements of M "guarded" if some tuple belonging to some atomic relation 
contains all these elements. We define distance for points in models (M, s), where s is a 
tuple of nodes, as follows: dist(s, s«,0) holds for Si G s, and dist(s,t,n + 1) holds if there 
is a u with dist(s, u, n) and {t, u} is guarded. Note that, by this definition, dist describes 
"distance at most" rather than "exact distance". 

We write Cut((M, s),n) for the submodel {t £ (M, s) | dist(s,t,n)} consisting of all 
points t in M lying at distance at most n from s. The following result shows that GF, like 
basic modal logic, satisfies a finite depth property, suitably defined: 

Lemma 3.24 (Distance-Depth Lemma). Let <p be any guarded formula of depth n, and 
let (N,s) be any submodel of (M, s) containing all of Cut{{M,s),n). Then (M,s) \= 4> iff 
(N, s)\=(f>. 

Next comes a generalization of modal bisimulation. A guarded bisimulation is a non- 
empty set F of finite partial isomorphisms between two models M and N which has the 
following back-and-forth conditions: given any function / : X — > Y in F, (i) for all guarded 
Z C M, there is a g G F with domain Z such that g and / agree on the intersection X (~)Z, 
(ii) for all guarded W Q N , there is a g 6 F with range W such that the inverses g -1 and 
/ _1 agree on Y n W. 

Also, we have "rooted" guarded bisimulations F running between models (M, si, . . . , s n ) 
and (N,ti, . . . ,t n ) with given initial objects, where one requires that {(sj,^) | 1 < i < n} 
is a partial isomorphism in F. By a simple inductive argument, one then shows: 

Fact 3.25. G-F-formulas are invariant for rooted guarded bisimulations: whenever there is 
a rooted guarded bisimulation between models (M, s\, . . . , s n ) and (N, t\, . . . , t n ), then for 
all GF- formulas 4>(xi, . . . , x n ), M \=cf> [si,...,s„] iff N \= <j) [t u ...,t n \. 

Andreka, van Benthem and Nemeti pQ show that, conversely, GF consists, up to logical 
equivalence, of just those first-order formulas which are invariant for guarded bisimulations. 

Another 'modal' use of guarded bisimulation in the same paper is model unraveling. 
This is like standard modal unraveling, but the construction is a bit more delicate: 

Definition 3.26 (Tree unravelings for GF). Let M be any model. A guarded path will 
be a non-empty finite sequence of guarded sets in M. We say that an element of M is 
active in a guarded path ir = {Si, . . . , S n ) if it belongs to S n and not to S n -i (or n = 1), 
and passive if it belongs to both S n and S n -\. The intuition is when an element occurs 
actively, a new copy is created in the unraveling, whereas when it occurs passively, the 
previous copy is used. Correspondingly, let = be the equivalence relation generated by 
(jri, d) ~ (-7T2, d) for 7T2 a guarded path extending iri with a single guarded set, and d passive 
in TT2- Now, the domain of the tree unraveling unrG^(M) consists of all pairs (vr,(i) where 
7T is a guarded path and d is active in ir, and the interpretation of predicate symbols Q is 
as follows. I(Q) contains the tuple {(tti, rfi), • • • , (vr^, d^)) if and only if (d±, . . . , d n ) belongs 
to the interpretation of Q in M, and there is a guarded path tt (which will in fact be a 
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common extension of 7Ti, . . . , 7r n ), such that di, . . . , dp- all belong to the final set of it, and 
(it, di) ~ (7Tj, di) for all 1 < i < n. 

For a model with parameters (M,s) this generalizes as follows. ux\yqf{M, s) has paths 
7r all starting from the initial set s, but then continuing with guarded sets only. The objects 
(tt, d) are defined as before. 

The point here is that the set F of all restrictions of the finite maps sending (7Tj, di) to 
di for all guarded finite domains in unrc^(M, s) is a rooted guarded bisimulation between 
{M, s) and unr gf(M, s). Checking the zigzag conditions for the bisimulation will reveal the 
reason for the above definition of the predicate interpretations I(Q). For other, essentially 
equivalent formulations of this notion, cf. [TJ [11] . 

Now we have the generalities in place for our Lindstrdm Theorem, but it remains to 
make some adjustments. Firstly, the notion of guarded bisimulation needs to be slightly 
adjusted. GF^- m -bisimulations are defined like guarded bisimulations, except that the back- 
and- forth conditions are only required to hold when XPiZ ^ and Y (~)W ^ $, respectively. 
It can be shown that GFb in is precisely the Gi^in-bisimulation invariant fragment of FO 
(over vocabularies consisting of unary and binary relation symbols only). 

Secondly, in the definition of tree unravelings, we make one simple change for the binary 
case: 

The finite paths of guarded sets always introduce one new object at each 

stage. At each step, one takes a new object related to that new object. 
This allows paths starting with object a and then continuing with Rab, Qcb, . . . , while 
ruling out paths like Rab, Qac. But the final atom is not omitted from the unraveled model, 
since one can have paths starting with a and then placing Qac immediately. Thus, even 
with these restricted paths, we still have a GFbi n -bisimulation between tree unravelings and 
their original models. The real point of this adjustment is the following observation: 

The definition of predicates for path objects makes binary relations hold only between 
objects (ni,di), (^2,^2) where 1T2 is a one-step continuation of the path m, or vice versa. 
But then, counting distance as before, 

The new object at the end of a path of length k lies at distance k from the 

initial object of the path. 
Put in more vivid terms, l tree distance is true distance 1 in the original model. This is 
a non-obvious fact. E.g., with ternary guards Rayz, objects at the end of a path may keep 
links to the initial object a which might recur at any finite depth in the guarded sets along 
the path@ Having a direct correspondence between the length of the path and the distance 
in the tree unraveling is essential, as it will allow us to establish a finite depth property 
analogous to Lemma 13.21 for the basic modal language. 

We will write unr GFbin (M) and unrcj? bin (M, s) for this new type of unraveling. Now 
everything is in place for our final argument: 

Proof of Theorem \3. 231 Let £ be any abstract logic extending GF^ m and satisfying Com- 
pactness and invariance for GFbin-bisimulations. As before, it suffices to show that every 
formula (j) S C is invariant for models that are equivalent for all G-Fbin-formulas up to some 
finite depth n. 

First, largely as in the earlier modal proof of Sect. 13.11 we use the Compactness of L, 
together with its Relativization Closure, to show that <p must have the Finite Occurrence 



'This observation is due to Martin Otto, p.c. to the authors. 
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Property and a Finite Distance Property for some level n. Before, universal prefix formulas 
O k p (for all finite k) made sure that p holds in the generated submodel at the current node. 
This time, one uses all nested sequences of universal guarded quantifiers up to depth k, 
requiring that some new predicate P holds for all objects reached at the end. The n thus 
found for the local depth of the formula (j) is the same n as needed for the following semantic 
invar iance: 

Given the above unraveling construction and invariance for Gi^bin-bisimulation for C, 
we may then assume, without loss of generality, that we have the following situation: 

(a) unr GFbin (M,s) \= cp 

(b) unrGF bin (M, s) and unrGF bin (N, t) satisfy the same GFbi n -formulas up to depth n 

Our aim is to show that (unrQF bjn (N, t)) \= 4>. 

We start by cutting the tree models to depth n, as before in our modal argument, 
obtaining Cut(umGF bm (M, s), n) and Cut(unrGF bm (N, t), n). Since tree depth is true depth, 
this does not change truth values of 4> in either model. 

Then we define a G-Fbin-bisimulation between these models. First of all, note that all 
elements in the first one are of the form (({s}, {d\, cfo}, • • • , {dk~i, dk}}, dk) with d £ s and 
< k < n) where each set in the sequence is guarded. Likewise for the second model. 
Secondly, note that the only guarded subsets in these models are singleton sets and sets of 
the form {(ir, d), ((ir, {d, e}), e)}. Now, let the relation Z be the set consisting of ({§}, {t}), 
as well as all sets of pairs 

{({(vr, d), (tt', d% (((vr, {d, e}>, e), ((vr', {d 1 , e'}),e'))} 

with 7r, 7r' sequences of some length k < n for which it holds that, for alH < k, the guarded 
sets 7Tj and 7r- satisfy the same Gi^-formulas of quantifier depth n — i. 

It can be shown (the argument involves a case distinction) that Z is a GFbin-bisimulation 
between Cut(umGF bm (M, s), n) and Cut(unrGF bm (N, i), n). Hence, the truth of <f> transfers 
from Cut(unrGF bm (M, s), n) to Cut(unrGF bm (N, i), n), and hence to (N,i). □ 

As indicated, this argument hits a barrier when ternary predicate are allowed, and 
hence we leave a similar Lindstrom characterization of the full guarded fragment GF as an 
open problem. Recently [16J, a Lindstrom-style characterization was obtained for the full 
GF along slightly different lines, by means of adding a further model-theoretic property, 
cf. the discussion below. 

4. Discussion 

To conclude, we identify a few lines of research suggested by our results. 

4.1. Charting the landscape of first-order fragments. We have seen how many frag- 
ments of first-order logic have Lindstrom characterizations, but with very different proof 
techniques. 'Top-down', we showed how careful coding can get the original proof down to 
cover the 3-variable fragment, while 'bottom up', we gave a modal technique which lifts to 
various richer modal-like extensions. 

There are more positive results than we have presented here: e.g., our coding tech- 
nique can easily be adapted to the bounded fragment of first-order logic (cf. [IH]), showing 
that every abstract logic extending the bounded fragment and having Compactness and the 
Lowenheim-Skolem property is contained in FO, and hence also the bounded fragment is 
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maximal with respect to Compactness, the Lowenheim-Skolem property, and invariance for 
generated submodels (a restriction to binary vocabularies is not necessary here). 

However, there is also a 'gap' between the two proof techniques, and logics like the 
2- variable fragment, or modal logic with an added 'universal modality' ("true in all nodes, 
accessible or not") pose a challenge. We may need new proof techniques here, or indeed, 
a rethinking of what a Lindstrom theorem should be in such cases. Recently, Otto and 
Piro [TB] have characterized modal logic extended with the universal modality as being a 
maximal system C with respect to Compactness, invariance for global bisimulations and 
also, the Tarski union property (i.e., preservation of ^-sentences under limits of chains 
of £-elementary embeddings), and similarly for the guarded fragment. Thus, additional 
preservation properties may be needed. 

4.2. Re-positioning first-order logic in this broader setting. Lindstrom's theorem in 
its classical formulation characterizes first-order logic only within the class of its extensions. 
A natural question to ask in our wider setting is within which broader classes of languages we 
can characterize first-order logic. Our Theorem 12.31 gave a partial answer: it characterized 
first-order logic within the class of extensions of FO s . This at once suggests many new 
questions. For instance, are there extensions of FO 2 not contained in FO that satisfy 
Compactness and the Lowenheim-Skolem property? @ 

4.3. Down and then up again: adding fixed-point operators. It is well-known that 
fragments of first-order logic can behave very differently from the full system when it 
comes to adding transitive closure, or even complete fixed-point extensions. In particu- 
lar, the modal ^-calculus remains decidable, just like the basic modal logic, whereas the 
logic LFP(FO) extending first-order logic with arbitrary fixed-point operators becomes 
not recursively enumerable, and indeed IT} complete. While fixed-point logics of both sorts 
are natural from a computational point of view, they have resisted Lindstrom-style analysis 
so far. Here is one question, out of many which suggest themselves following our earlier 
results: 

Question 4.1. Can the modal //-calculus be characterized in terms of bisimulation invari- 
ance and the finite model property? 

4.4. Characterizing logics on specific classes of structures. No Lindstrom character- 
izations are known for first-order logic on finite structures, or on trees. Compactness fails 
for first-order logic on such structures, and, on finite structures, the Lowenheim-Skolem 
property becomes meaningless. In this paper, we proved one positive result: we showed 
that GML behaves on trees as first-order logic does on arbitrary structures: it is maximal 
with respect to Compactness and the Lowenheim-Skolem property. In general, however, 
this area remains underexplored: [12] raise the issue of the 'missing Lindstrom theorem' 
for finite model theory. Here we state just one related technical question. Given a logic 
C interpreted on finite structures and a logic £ interpreted on arbitrary structures (both 

4 Theorem l3.5l gave an extension of modal logic not contained in first-order logic which has these properties. 
^This formulation naturally arises in our setting since so many of our modal arguments involved finite 
reachability of nodes from the origin, a typical non-first-order fixed-point notion. 
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satisfying the usual conditions such as closure under the Boolean operations) , let us say that 
£ is a conservative extension of C if C is simply the restriction of £' to finite structures. 

Question 4.2. Are there extensions of first-order logic on finite structures that have a con- 
servative extension to infinite structures satisfying Compactness and Lowenheim-Skolem? 

4.5. Relating different characterizations: the case of interpolation. Finally, taking 
stock of our results, there is also the more elusive, but still interesting question of "what 
is a Lindstrom theorem?". As we already indicated in the introduction, there are many 
natural ways to characterize the expressive power of a logic, including Lindstrom theorems, 
preservation theorems, and characterizations involving Craig interpolation. 

And even the Lindstrom theorems in this paper fall into different classes. Some use 
Compactness and Lowenheim-Skolem, or some other size-restricting principle of model ex- 
istence, while others use Compactness and Invariance for Potential Isomorphisms, or some 
other semantic transfer property of the language. While these formulations both capture 
first-order logic, we have seen that they diverge for basic modal logic, where we have a char- 
acterization of the latter type but not of the former. The precise extent of this phenomenon 
remains to be understood. 

While the picture of 'natural properties' is not quite clear yet, it is noticable that Lind- 
strom theorems seem to go hand in hand with interpolation results in abstract model theory 
and related areas. Among the proper extensions of FO the main examples of interpolation 
are L WlW , its countable admissible fragments La, and second order logic SO. Barwise [3] 
gives a maximality characterization of L WlUJ and its countable admissible fragments La in 
terms of a (generalized) recursion theoretic criterion called strict absoluteness. The proof 
uses interpolation^ Also second-order logic has a maximality characterization: SO is the 
maximal extension of FO in which every definable model class has a "flat" definition in set 
theory [22]. Again this result is intimately connected with the (trivial) way in which SO 
satisfies (single-sorted) interpolation. Finally, the interpolation theorem of L^^ for entail- 
ment along potential isomorphism in [2] is related to a maximality characterization of this 
logic: Lqo^ is the maximal extension of FO which is bounded and has the Karp Property 
[4]. We feel that this link between Lindstrom theorems and interpolation theorems requires 
further analysis. 

Moreover, this connection makes sense for our study of fragments as well. Some of 
our results used interpolation properties already, and in particular, our paradigmatic modal 
logics have them. Indeed, among fragments, even further types of interpolation property 
emerge, which fail for first-order logic. Thus, the two major modal logics for which we 
managed to obtain Lindstrom theorems, namely basic modal logic and graded modal logic, 
both possess uniform interpolation, a strong form of Craig interpolation where the inter- 
polant can be chosen uniformly for all consequents sharing some specified vocabulary with 
the antecedent. We would like to understand the impact of this condition on abstract logics 
in conjunction with our Lindstrom-style analysis in this paper. 



It is arguable whether this characterization of L bJlbJ (or La) should be considered a Lindstrom type 
results as strict absoluteness is not a model theoretic condition. However the boundary between generalized 
recursion theory and infinitary model theory is not very strict. 
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